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Abstract 



Motivated by a simple broadcast channel, we generalize the notions of a less noisy receiver 
and a more capable receiver to an essentially less noisy receiver and an essentially more capable 
receiver respectively. We establish the capacity regions of these classes by borrowing on existing 
techniques to obtain the characterization of the capacity region for certain new and interesting 
classes of broadcast channels. We also establish the relationships between the new classes and 
the existing classes. 

1 Introduction 

This paper is motivated directly by a simple broadcast channel setting, posed by Andrea Mon- 
tanari(see Figure [pj, consisting of a BSC(p) and BEC(e). Clearly if e < 2p, then the channel is 
degraded^ and the capacity [2J [3] is given by the union of rate pairs R 2 ) satisfying 

Ri < KU-Yi) 
R 1 + R 2 <I(U;Y 1 )+I(X;Y 2 \U) 

over all (U, X) such that U — > X — > (Yi, Y 2 ) form a Markov chain. 



X 



Figure 1: A broadcast channel consisting of a BSC(p) and BEC(e) 



If 1 — H{p) < 1 — e then Y 2 would be a more capable^ receiver than Y± (see Parts 1,2 of Claim 
[3] in the Appendix) and in this case capacity[5] is given by the union of rate pairs satisfying 

Ri <I(U; Yi) 
R 1 + R 2 <I(U;Y 1 ) + I{X;Y 2 \U) 
Ri+R 2 <I{X ] Y l ) 
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over all (U,X) such that U — > X — ► (Yi, Y2) form a Markov chain. Therefore, the interesting case 
for determining the capacity region occurs when 1 — H(p) > 1 — e. 

Hence we restrict ourselves to the case when 1 — H(p) > 1 — e, i.e. when the channel X — > Y\ 
has a higher capacity than the channel X — ► Y 2 . The Figure [2] plots I{X\Y\) — I{X]Y 2 ) for the 
case p = 0.1, e = 0.5. It is clear that neither is more capable than the other. In particular this 
setting does not fall into any class of broadcast channels for which the capacity region has been 
characterized. We address this regime and establish the capacity region. 

In fact we establish the capacity region of a whole new class of broadcast channels (motivated 
of course by this example) that contains this broadcast channel, under the regime 1 — H (p) > 1 — e, 
as a special case. 




Figure 2: The function I{X\Yx) - I(X;Y 2 ) for BSC(O.l) and BEC(0.5) 



1.1 Observation 

The common theme between degraded, less noisy, and more capable channels is the existence of a 
dominant receiver who manages to decode the private messages for both the users. Let us recall 
the definition of the less noisy^ receiver. One requires I(U;Y 2 ) < I(U; Yi) to hold true for every 
p(u,x) to classify receiver Y\ as a less noisy receiver than Y 2 . In this paper we remove the need for 
I(U ; Y2) < I{U; Y\) holding true for every p(u, x) and replace it by I(U ; Y 2 ) < I(U ; Yi) holding true 
for a sufficiently large class of distributions p(u,x). We then show that for this relaxed definition 
of a less noisy receiver as well, the capacity regions can be obtained. 

A similar development has been done for a more capable receiver as well. 

Remark 1 . The key contribution of this paper is the identification of capacity regions for interesting 
classes of broadcast channels by proving that these channels belong to a slightly tweaked definition 
of a less noisy or a more capable receiver. Secondly, the tweaking of the definitions is done in such 
a way that the existing techniques are sufficient to establish the capacity regions. 

The organization of the paper is as follows. In section [2] we make the formal definitions and set 
up the required notation. In Section [3] we establish the capacity region of a class of two-receiver 
broadcast channels that has one receiver who is essentially less noisy when compared to the other. 
In Section |4] we identify certain interesting classes of channels that are neither less noisy nor more 
capable but are essentially less noisy. In Section [5] we establish the capacity region of a class of 
two-receiver broadcast channels that has one receiver who is essentially more capable than the 
other. Finally in Section [6] we establish the various inclusions among these classes. 
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2 DEFINITIONS AND NOTATION 



2 Definitions and Notation 

In PQ, Cover introduced the notion of a broadcast channel through which one sender transmits 
information to two or more receivers. For the purpose of this paper we focus our attention on 
broadcast channels with precisely two receivers. 

A broadcast channel (BC) consists of an input alphabet X and output alphabets and y 2 
and a probability transition function p(yi, y 2 \x). A {{2 nRl , 2 nR ' 2 ), n) code for a broadcast channel 
consists of an encoder 

X n :WixW 2 ^ X n , 

and two decoders 

Wi : -> Wi 

w 2 : y 2 -» m, 

where Wi = {1, 2, 2 nRl }, W 2 = {1, 2, 2 ni?2 }. 

(n) 

The probability of error Pe is defined to be the probability that the decoded message is not 
equal to the transmitted message, i.e., 

p(») = P (jWi(Yi) ± Wi} U {W 2 {Y 2 n ) ± W 2 }) 

where the message pair (Wi, W 2 ) is assumed to be uniformly distributed over Wi x W 2 . 

A rate pair (Ri, R 2 ) is said to be achievable for the broadcast channel if there exists a sequence 
of ((2 nRl ,2 nR2 ),n) codes with -> 0. The capacity region of the broadcast channel with is 
the closure of the set of achievable rates. The capacity region of the two user discrete memoryless 
channel is unknown. 

The capacity region is known for lots of special cases such as degraded, less noisy, more capable, 
deterministic, semi-deterministic, etc. - see [6] and the references therein. In this paper we establish 
the capacity region for two more classes of broadcast channels, one where one receiver is essentially 
less noisy compared to the other receiver; and the other where one receiver is essentially more 
capable than the other receiver. 

A channel X — > Y 2 is said to be a degraded version of the channel X — ► Y\ if X — ► Y\ — » Y 2 ' i s 
a Markov chain and the pair (X, Y 2 ) is identically distributed as the pair (X,Y 2 ). A receiver Y± is 
said to be less noisy^ compared to Y 2 if 

I(U;Y 2 ) <I(U;Yi) 

for all p(u,x) such that U — > X — > (Y\,Y 2 ) forms a Markov chain. Finally, a receiver Y\ is said to 
be more capable^ compared to Y 2 if 

I(X;Y 2 ) <I{X;Y X ) 

for all p(x). 

Definition 1. A class of distributions V = {p(x)} on the input alphabet X is said to be a sufficient 
class of distributions for a 2-receiver broadcast channel if the following holds: Given any triple of 
random variables (U, V, X) distributee^ according to p(u, v, x), there exists a distribution q(u, v, x) 

1 In all cases we assume that the tuple (U, V, X, Y\, Y2) satisfies (U, V) — > X — > (Yi, Y2) forms a Markov chain. In 
a discrete memoryless broadcast channel with no feedback this assumption is "automatically" satisfied. However it 
is necessary to state it explicitly to prevent choices like U — Yi (except when X — > Y\ is deterministic) and other 
strange choices. 
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3 THE CAPACITY REGION OF A BROADCAST CHANNEL WITH AN ESSENTIALLY 

LESS NOISY RECEIVER 



that satisfies 



q(x) € V, 



I(U;Yi) p < I(U;Yi) q , i = 1,2, 
I(V;Yi) p < I(V;Yi) q , i = 1,2, 
I(X;Yi\U) p < I(X;Yi\U) q , i = 
I(X;Yi\V) p < I(X;Yi\V) q , i = 
I(X;Y) p <I(X;Y) q , * = 1,2. 



= 1,2 



= 1,2 



(1) 



The notation I(U;Yi) p denotes the mutual information between U and Y\ when the input is 
generated using p(u, v, x). 

Definition 2. A receiver Y\ is essentially less noisy compared to receiver Y 2 if there exists a 
sufficient class of distributions V such that whenever p(x) S V, for all U — > X — > (Y\, Y2) we have 



Remark 2. Setting V to be the entire set of distributions p(x) shows that a less noisy receiver is 
in particular an essentially less noisy receiver. However, in Section |4] we will show that there are 
essentially less noisy receivers that are not less noisy. 

Definition 3. A receiver Y\ is essentially more capable compared to receiver Y2 if there exists a 
sufficient class of distributions V such that whenever p(x) E V, for all U — ► X — ► (Y±, Y2) we have 



Remark 3. Clearly the above condition holds when Y\ is a more capable receiver than Y±, since it 
holds under each conditioning of U. Thus by setting V to be entire set of distributions on X ; if Y± 
is also a more-capable receiver than it is also an essentially more capable receiver. 

3 The capacity region of a broadcast channel with an essentially less noisy 
receiver 

Theorem 1. The capacity region of a two-receiver broadcast channel where Y\ is essentially less 
noisy compared to Y2 is given by the union of rate pairs (i?i,i?2) such that 



for some U — * X — > (Yi, Y2) and p(x) € V . Here V denotes any sufficient class of distributions that 
makes the receiver Y\ essentially less noisy compared to receiver Yi- 

Proof. The theorem follows in a straightforward manner from the known achievability regions and 
outer bounds for the two receiver broadcast channels, as shown below. 



I(U;Y 2 ) < HUM). 



I{X;Y 2 \U) < I(X;Yx\U). 



R2<I{U;Y 2 ) 
Rx + R 2 <I(U;Y 2 ) + I(X;Y 1 \U) 
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4 A CLASS OF SYMMETRIC BROADCAST CHANNELS WITH AN ESSENTIALLY LESS 

NOISY RECEIVER 



The direct part 



It is well-known [7] that the set of all rate pairs (R\,R 2 ) satisfying 



R 2 <I(U;Y 2 ) 
Ri+R 2 <I(U;Y 2 ) + I(X;Y 1 \U) 
R l + R 2 <I(X;Y 1 ) 



(2) 



for any U — > X — > (Y\,Y 2 ) is achievable via superposition coding. Further if p(x) S V, we have 
I(U;Y 2 ) < I(U;Yi) and thus ([2]) reduces to the region in Theorem Q] and completes the proof of 
the achievability. 

The converse part 

It is well-known [SIE] that the set of all rate pairs (R±,R 2 ) satisfying 



over all U — > X — > (Y%, Y 2 ) forms an outer bound to the capacity region of the broadcast channel. 
Clearly from the definition of the sufficient class V it is clear that one can restrict the union to be 
over p(x) € V. Further if p{x) £ V, we have I(U; Y 2 ) < I(U; Y%) and thus 



This implies that ([3]) reduces to the region in Theorem [T] and completes the proof of the converse 
to the capacity region. 

4 A class of symmetric broadcast channels with an essentially less noisy receiver 

In this section we prove that the class with an essentially less noisy receiver of channels is strictly 
larger than the class where where one receiver is less noisy [1] compared to the other receiver. In 
particular this class contains the channel that served as the motivation behind this paper - the 
broadcast channel when one of the channels is BSC(p) and the other is BEC(e); where the pair 
(p, e) satisfies 1 — H(p) > 1 — e. 

Definition 4. A channel with input alphabet X [X = {0, 1, ...m — 1}), output alphabet y (of size 
n) is said to be c-symmetric if, for each j = 0, .., m — 1, there is a permutation 7r,-(-) of 3^ such that 
P(Y = TTj(y)\X = (i + j) m ) = P(Y = y\X = i), Vi, where (i + j) rn = (i + j) mod m. 

Observe that BSC and BEC are examples of c-symmetric channels. 

A broadcast channel with input alphabet X and output alphabets 3^1,3^2 is said to be c- 
symmetric if both the channels X — > Y\ and X — ► Yi are c-symmetric. 

Lemma 1. The uniform distribution on X forms a sufficient class V for a c-symmetric broadcast 
channel. 



R2<I(U-Y 2 ) 
Ri+R 2 <I(U;Y 2 ) + I(X;Y l \U) 

R 1 </(x ; yi) 



(3) 



I{U;Y 2 ) + IiX-Y^U) <I{X;Yt). 
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4 A CLASS OF SYMMETRIC BROADCAST CHANNELS WITH AN ESSENTIALLY LESS 

NOISY RECEIVER 



Proof. Let X = {0,1, ...,m - 1}. Given a triple (U,V,X) construct a tuple (W',U',V',X') as 
follows: 

p(W' = j, U' = u, V = v, X' = i) 

= -P(U = u,V = v,X = (i + j) m ). 
m 

Further set (W', U' , V) — > X' —> (Y{, Y 2 ) to be a Markov chain with p(y[, y' 2 \x') = p(yi, 2/2 |z) (i.e. 
the channel transition probability remains the same). 
Observe that 

p(w' = j, u' = u,Y( = y) 

= ]T P(W' = j, U' = u,X' = i, Y{ = y) 

i 

= -TP(U = u,X = (i + j) m , Y 1 = 7Vj(y)) (4) 

i 

= -P(U = u,Y 1 =7r j (y)). 
m 

Similarly 

P(W = j, U' = u, Y{ = y) = -P(U = u,Y 2 = a^y)). (5) 

m 

It is easy to see that the following holds: 

P(X' = i) = — Vi 
m 

I(X'; Y>\W' = j) = I(X- Yi) Vj, i = l,2 
I(U'; Y(\W' = j) = I(U; Y t ) Vj, i = 1, 2 
I(X'; Yj\U', W = j) = I(X; Y t \U), Vj, i = 1, 2 

where all equalities (except the first one) follow from equations dU), ©, and that entropy is un- 
changed by relabeling. Similar conditions also holds for the pair (W',V). 

Therefore setting U = (W' ,U'),V = (W',V') and q(u,v,x) to be the distribution induced 
by (U, V, X') it is easy to see that the inequalities ([1]) are satisfied. As P(X' = i) = — Vi this 
establishes the sufficiency of the uniform distribution. □ 

Definition 5. In a c-symmetric broadcast channel Y\ is said to be a dominantly c-symmetric 
receiver if the following condition holds: for every p(x) 

I{X- Y y ) p - I(X; Y 2 ) p < I{X; Yi) u - I(X; Y 2 ) u , 

where u(x) is the uniform distribution. 

In other words, uniform distribution also maximizes the difference I(X;Yi) — I(X; Y 2 ). 

Claim 1. For the BSC(p), BEC(e) broadcast channel Y\ is a dominantly c-symmetric receiver 
when 1 — H{p) > 1 — e. 

The proof follows from part 3 of Claim [3] in the appendix; also see Figure El 
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5 THE CAPACITY REGION OF A BROADCAST CHANNEL WITH AN ESSENTIALLY 

MORE CAPABLE RECEIVER 



Lemma 2. In a c-symmetric broadcast channel, if Y\ is a dominantly c- symmetric receiver then 
Y\ is also an essentially less noisy receiver. 

Proof. Since the uniform distribution on X forms a sufficient class; by Lemma [1] it suffices to show 
that for all (V, X) such that p(x) is uniform we have 

I(V\Y X )>I(V;Y 2 ). (6) 

Given a pair (V, X) let p v (x) be the distribution on X when V = v. Y\ is a dominantly c-symmetric 
receiver implies 

I(X- Yx) Pv - I(X; Y 2 ) Pv < I(X- Y x ) u - I(X; Y 2 ) u . 

Therefore 

IiX-Y^V) - I{X;Y 2 \V) 

= J2?(V = v) (I{X-Y{) Pv - I(X;Y 2 ) Pv ) 

V 

<J2P(V = v) (I(X; Y,) u - I(X; Y 2 ) u ) 

V 

= I(X;Y 1 ) U -I(X;Y 2 ) U . (7) 

Since V — > X — > (Yi,Y 2 ) is Markov and p(x) is uniform, observe 

IiX-^V) - I{X-Y 2 \V) 

= I(X; Y x ) u - I(V; Y x ) - (I(X; Y 2 ) u - I(V; Y 2 )) 

= I(X; Y x ) u - I(X; Y 2 ) u - (I(V; Yi) - I(V; Y 2 )) . (8) 
The required inequality ((6|) follows from ([7|) and ([8]) respectively. 

5 The capacity region of a broadcast channel with an essentially more capable 
receiver 

Theorem 2. The capacity region of a two-receiver broadcast channel where Y\ is essentially more 
capable compared to Y 2 is given by the union of rate pairs (R\,R 2 ) such that 

R2<I(U;Y 2 ) 
Ri + R 2 <I{U-Y 2 ) + I{X-Y l \U) 
Ri + R 2 <I(X;Y 1 ) 

for some U — * X — > (Yi, Y 2 ) and p(x) € V . Here V denotes any sufficient class of distributions that 
makes the receiver Y\ essentially more capable compared to receiver Y 2 . 

The direct part 

It is well-known [7] that the set of all rate pairs (R±,R 2 ) satisfying 

R2 <I(U; Y 2 ) 

Ri + R 2 <I(U;Y 2 ) + I(X;Y 1 \U) (9) 
Ri + R 2 <I(X;Y 1 ) 

for any U — > X — > (Yi,Y 2 ) is achievable via superposition coding. Restricting ourselves to p(x) £ V, 
we have the region in Theorem [TJ and completes the proof of the achievability. 
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6 ON INCLUSION RELATIONSHIPS BETWEEN CLASSES OF BROADCAST CHANNELS 



The converse part 

It is well-known [3[5] that the set of all rate pairs (R±,R 2 ) satisfying 

R 2 <I(U;Y 2 ) 

Ri + R 2 <I(U;Y 2 ) + I{X;Y 1 \U) (10) 

R 1 <I(V;Y 1 ) 
Ri + R 2 <I(V;Y 1 ) + I(X;Y 2 \V) 

over all (U,V) — ► X — ► (Yi,Y 2 ) forms an outer bound to the capacity region of the broadcast 
channel. Clearly from the definition of the sufficient class V it is clear that one can restrict the 
union to be over p(x) € V . Further if p(x) € V, since Y\ is an essentially more capable receiver we 
have I(X;Y 2 \V) < l{x;Y x \V) and thus 

/(y ; y 1 ) + 7(X;r 2 |F)</(x ; y 1 ). 

This implies that (|10p is contained inside the (achievable) region in Theorem Q] and completes the 
proof of the converse to the capacity region. (Indeed it is easy to see that setting V = X is optimal 
and thus reduces the region in (|10p to the region in Theorem [2j) Q 

6 On inclusion relationships between classes of broadcast channels 

In this section, we present the various relationships between the classes of 2-receiver broadcast 
channels that were discussed in the paper. 




Figure 3: The classes of broadcast channels with a superior receiver. I - degraded, iY - less noisy, 
III- essentially less noisy, IV - more capable, V - essentially more capable 

Claim 2. We claim that the following relationships, as shown in Figured hold 

(i) Degraded C less noisy C more capable, 
(ii) less noisy C essentially less noisy, 
(Hi) essentially less noisy ^ more capable, 
(iv) essentially less noisy =£• essentially more capable, 

(v) more capable =£• essentially less noisy. 
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(vi) more capable C essentially more capable, 

Proof. Part (i) was established in [1]. Part (ii) follows from Remark [2] and Sectional Part (Hi) 
follows from Figure [2] and Section [H 

Part (iv): This is again easy to deduce from Figure EJ Take U to be a binary random variable 
with P(U = 0) = \ and take P(X = 0\U = 0) = e, P(X = 0\U = 1) = 1 - e. For sufficiently small 
e we have that I(X;Y2\U) > I(X;Y\\U) and hence Y\ is not an essentially more capable receiver 
than Y 2 . Note that p(x) is uniform and hence any sufficient class V must contain the uniform 
distribution. 

Part (v): Let X -»■ YI be BEC(0.5) and X ^ Y 2 be BSC(O.llOl). Observe that 0.5 = 1 - e > 
1 — H (p) ~ 0.4998, and from part 2 of Claim [3] in the Appendix we can see that Y± is a more 
capable receiver than Y 2 . Let J7 — > X be BSC(0.05), and set P(U = 0) = 0.5. This implies 
P(X = 0) = 0.5 G V and it is easy to see that 0.3568 « I(£7;Y~i) < I(Z7; Y~ 2 ) ~ 0.3924 and thus Y x 
is not an essentially less noisy receiver than Y%. (Note that this also implies that essentially more 
capable =£> essentially less noisy.) 

Part (vi): From Remark [3] it is clear that more capable C essentially more capable. Hence it 
suffices to prove that essentially more capable =£> more capable. To this end, consider the following 
channel. The alphabets are given by X = {0, l,2,3},3^i = 3^2 = {0, 1}. The channel X — > Y\ is a 
perfectly clean channel when X € {0, 1}, and is the completely noisy BSC(0.5) when X € {2,3}. 
The channel X -> Y 2 is a BSC(O.l) when X e {0, 1} and BSC(0.4) when X & {2,3}. When p(x) 
is uniform on X = {2, 3} we have I(X; Y 2 ) > I(X; Y\); implying Y\ is not a more capable receiver 
than Y 2 . However it is easy to show that p(x) uniform on X = {0, 1} forms a sufficient class, and 
clearly on this sufficient class Y\ is a more capable receiver than Y 2 . This example shows that there 
are essentially more capable receivers that need not be more capable. □ 
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Consider a broadcast channel with two receivers. Let X — ► Y\ beBSC(p), < p < i and 
X -> Y 2 be BEC(e). Let 

D{x)=I(X;Y l )-I(X;Y 2 ) 

= H(x * p) - (1 - e)H(x) - H{p) 

be the difference I{X;Y\) — /(A; Y2) conditioned on P(A = 0) = x. Observe that the function is 
symmetric about x = \, i.e. D(x) = D(l — x). 

Claim 3. The function D(x) has the following properties: 

1. When e < 2p, D(x) monotonically decreases in the interval [0, ^]. 

2. When 2p < e < H(p), D(x) monotonically decreases in the interval [0,r], and monotonically 
increases in the interval [r, |] for some r G (0, |]. The maximum occurs at x = 0, i.e. 
D(x) < 0,Vx G [0,1]. 

3. When H(p) < e < 1, D{x) monotonically decreases in the interval [0, r], and monotonically 
increases in the interval [r,^] for some r G (0,^). The maximum occurs at x = h, i.e. 
D(x) < D(\),\/x G [0,1]. 

Proof. Let J(x) = log 2 ^jp. Observe that 

^-D{x) = (1 - 2p) J(x *p)-(l- e)J(x). (11) 

For x G [0, |], when e < 2p we have £D(x) < 0, since < J(x *p) < J(x) and establishes Part 1. 
From (fTTj) any x such that -^D(x) = must satisfy 

1 - : 



+ 1 =x(l-p)+p(l-x), (12) 



where c = ■ Define 



When e > 2p, we have < c < 1. Then it is easy to see that L{x) is concave in x G [0, Observe 
that 



dx L ^ ( x (i _ s)) 1-c [(l - x) c + x c )] 2 x l - c 

and since the functions: (x(l — x)) c , (1 — x) c + x c , x 1_c increase in x G [0, ^], we have -^L(x) < 0. 
This implies that L(x) can intersect the line R(x) = x(l — p) + p(l — x) at possibly no more than 
two points on x G [0, ^]. Since L(^) = ^ = -R(^), there is at most one other solution r G (0, ^) to 
CED when x G (0, ±). 

Since -^D(x) — > — oo as x — > + , it is clear that -D(x) decreases in [0,r] and increases in [r, i]. 
The maximum can therefore be obtained by comparing D(0) = and £>(§) = e - This 
establishes Parts 2,3. □ 



10 



REFERENCES 



REFERENCES 



Claim 4. Consider a broadcast channel with two receivers. Let X ->Yi be BSC(p), < p < \ and 
X — > Y 2 be BEC(e). Then the following holds: 

1. < e < 2p: Y\ is a degraded version ofY 2 . 

2. 2p < e < 4p(l — p): Y 2 is less noisy than Y\, but is not a degraded version. 

3. 4p(l — p) < e < H(p): Y% is more capable (but not less noisy) than Y2. 

4- H{p) < e < 1: Y\ is essentially less noisy than Y2. 

Proof. Part 1 is well-known and easy to establish. Part 4 follows from Claim [3] and Section [U We 
also know from Parts 1 and 2 of Claim [3] that when < e < H (p) D{x) < 0, i.e. Y2 is a more 
capable receiver than Y\ . Therefore to complete the proof of the claim it suffices to show that I2 
is a less noisy receiver than Y\ if and only if0<e< 4p(l — p). 
The following statements are equivalent (the proof is immediate and omitted) : 

(i) Y2 is a less noisy receiver than Y±; 

(ii) W^X^{Y U Y 2 ), I(X; Yi\U) - I(X; Y 2 \U) > I(X; - I(X; Y 2 ); 
{Hi) I(X;Yi) p r x ) — I(X;Y 2 ) P ( X ) is a convex function of p(x). 

Therefore Y 2 is a less noisy receiver than Y\ if and only if D{x) is convex for x S [0, 1]. It is 
straight forward to see that D(x) is convex if and only if0<e< 4p(l — p). □ 
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